Rank | Count | Beginning |
---|---|---|
1429 | 801 | އަދި |
725 | 429 | އެ |
726 | 308 | މި |
4828 | 270 | އެކަމަކު |
4825 | 256 | ނަމަވެސް |
1469 | 208 | އޭނާ |
1432 | 169 | އެއީ |
1464 | 128 | އޭގެ |
1431 | 127 | މިއީ |
3301 | 122 | އެހެން |
1657 | 87 | މީގެ |
7515 | 84 | އެހެންކަމުން |
4827 | 65 | އެގޮތުން |
4826 | 59 | އެހެންވެ |
1428 | 55 | ދެން |
3411 | 43 | މިދިޔަ |
3386 | 41 | ރައީސް |
3300 | 40 | މިހާރު |
3389 | 40 | އެންމެ |
3549 | 39 | ބައެއް |
5283 | 36 | ރާއްޖޭގެ |
7979 | 35 | އަޅުގަނޑުމެން |
38 | 33 | "މި |
3328 | 33 | އެކަން |
3325 | 30 | އޭނާގެ |
4963 | 30 | ފުލުހުން |
774 | 29 | ގެ |
6623 | 29 | މިނިސްޓަރު |
6757 | 29 | ފުލުހުންގެ |
4843 | 28 | އަހަރެން |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV